Synonymous Codon Substitution Matrices
نویسندگان
چکیده
Observing differences between DNA or protein sequences and estimating the true amount of substitutions from them is a prominent problem in molecular evolution as many analyses are based on distance measures between biological sequences. Since the relationship between the observed and the actual amount of mutations is very complex, more than four decades of research have been spent to improve molecular distance measures. In this article we present a method called SynPAM which can be used to estimate the amount of synonymous change between sequences of coding DNA. The method is novel in that it is based on an empirical model of codon evolution and that it uses a maximum-likelihood formalism to measure synonymous change in terms of codon substitutions, while reducing the need for assumptions about DNA evolution to an absolute minimum.We compared the SynPAMmethod with two established methods for measuring synonymous sequence divergence. Our results suggest that this new method not only shows less variance, but is also able to capture weaker phylogenetic signals than the other methods.
منابع مشابه
The rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias.
Genes sequences from Escherichia coli, Salmonella typhimurium, and other members of the Enterobacteriaceae show a negative correlation between the degree of synonymous-codon usage bias and the rate of nucleotide substitution at synonymous sites. In particular, very highly expressed genes have very biased codon usage and accumulate synonymous substitutions very slowly. In contrast, there is litt...
متن کاملA combined empirical and mechanistic codon model.
The evolutionary selection forces acting on a protein are commonly inferred using evolutionary codon models by contrasting the rate of synonymous to nonsynonymous substitutions. Most widely used models are based on theoretical assumptions and ignore the empirical observation that distinct amino acids differ in their replacement rates. In this paper, we develop a general method that allows assim...
متن کاملThe problem of counting sites in the estimation of the synonymous and nonsynonymous substitution rates: implications for the correlation between the synonymous substitution rate and codon usage bias.
Most methods for estimating the rate of synonymous and nonsynonymous substitution per site define a site as a mutational opportunity: the proportion of sites that are synonymous is equal to the proportion of mutations that would be synonymous under the model of evolution being considered. Here we demonstrate that this definition of a site can give misleading results and that a physical definiti...
متن کاملRates of synonymous substitution do not indicate selective constraints on the codon use of the plant psbA gene.
The psbA gene of the flowering plant chloroplast genome has a pattern of codon bias that differs from all other angiosperm chloroplast genes. In psbA, unlike all other chloroplast genes, the third-codon-position composition does not reflect the general genome compositional bias of a high A+T content. Instead, in specific synonymous groups, the codon use of psbA more closely corresponds to the t...
متن کاملNucleotide substitution pattern in rice paralogues: implication for negative correlation between the synonymous substitution rate and codon usage bias.
Understanding the correlation between synonymous substitution rate and GC content is essential to decipher the gene evolution. However, it has been controversial on their relationship. We analyzed the GC content and synonymous substitution rate in 1092 paralogues produced by two large-scale duplication events in the rice genome. According to the GC content at the third codon sites (GC3), the pa...
متن کامل